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LONG-TERM  GOALS 

This  project  is  a  collaborative  effort  with  C.  Jenkins  at  the  Univ.  Colorado.  The  long-term  goals  are  to: 
(1)  advance  the  understanding  of  the  spatial  variability  of  seabed  properties  as  a  function  of  geologic 
environment;  (2)  develop  robust  means  of  interpolation  in  the  presence  of  uncertain  data,  (3)  provide 
for  the  estimation  of  uncertainty  in  the  interpolation  at  unsampled  locations,  and  enable  investigation 
of  optimal  survey  design  to  minimize  uncertainties;  and  (4)  publish  a  computational/database  structure 
capable  of  producing  seafloor  maps  of  wide  geographic  extent,  for  multidisciplinary  use  -  in  global 
change  issues,  defense,  engineering,  and  ecology. 

OBJECTIVES 

The  US  Geological  Survey,  in  collaboration  with  Chris  Jenkins  of  INSTAAR/Univ.  Colorado,  has 
recently  released  a  large  data  base  of  seabed  sedimentary  properties  in  US  coastal  and  shelfal  waters. 
Dubbed  “usSEABED”  (Figure  1;  Williams,  et  ah,  2003;  Reid  et  ah,  2005,  2006;  Buczkowski  et  ah, 
2006;  http://walrus.wr.usgs.gov/usseabed;  USGS  publications  DS  1 18  (Atlantic),  146  (Gulf  of  Mexico) 
and  182  (Pacific)),  the  data  base  is  a  compilation  of  available  records  of  sedimentary  data  from 
seafloor  samples,  cores,  and  visual  observations.  Over  120,000  independent  seafloor  mean  grain  size 
measurements  are  included,  along  with  many  other  data  types.  This  work  is  part  of  a  larger  effort  to 
develop  a  world-wide  data  base  (collectively,  the  “dbSEABED”  data  bases),  which  continues  to 
enlarge  every  year  with  additional  records  (http://instaar.colorado.edu/~jenkinsc/dbseabed). 

The  comprehensive  dbSEABED  data  bases  provide  a  new  and  unprecedented  opportunity  for 
advancing  Navy  interests  related  to  the  acoustic  response  of  the  seafloor:  (1)  they  will  enable  the 
creation  of  maps  of  seafloor  sedimentary  properties  over  areas  and  at  a  level  of  detail  previously 
unobtainable  with  single  campaign  efforts;  and  (2)  they  will  enable  investigation  of  the  variability  of 
sedimentary  properties,  which  is  a  critical  factor  in  assessing  uncertainty  in  acoustic  detection,  over  a 
wide  range  of  environmental  conditions.  However,  the  highly  heterogeneous  nature  of  these  records 
present  important  challenges  in  data  handling.  The  most  significant  issue  involves  the  measurements  of 
mean  grain  size  either  by  “extracted”  or  “parsed”  methods.  Extracted  measurements  are  derived  from 
analytic  methods  of  computing  the  grain  size  histogram,  such  as  sieves,  settling  tubes  or 
diffractometry.  Such  measurements  tend  to  be  very  precise.  Parsed  measurements  of  mean  grain  size 
are  derived  from  a  calibrated  conversion  of  a  word-based  description  of  sediments  (e.g.,  sand,  fine 
sand,  mud,  muddy  sand,  gravelly  sand,  etc.).  Although  less  precise,  such  measurements  provide  the 
only  comprehensive  coverage  in  many  regions.  The  extent  to  which  these  two  forms  of  measuring 
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mean  grain  size  are  compatible  is  a  critical  outstanding  and  a  consequent  objective  of  our  work. 
Observational  bias  represents  a  second  significant  issue.  Bias  is  most  prominently  manifest  in  the 
inclusion  or  exclusion  of  coarse  fraction  (e.g.,  shell  hash),  depending  on  the  needs  of  the  observer. 
Extracted  measurements  appear  more  likely  to  exclude  shells,  which  are  important  to  acoustic 
considerations. 


Figure  1.  Location  map  of  current  (2007)  usSEABED  data  coverage  (250,000  records),  color  coded 

by  mean  grain  size  (Williams,  et  al.,  2006). 


APPROACH 

We  formulated  a  methodology  for  statistical  analysis  of  randomly-located  marine  sediment  point  data, 
and  applied  it  to  the  U.S.  continental  shelf  portions  of  usSEABED  mean  grain  size  records.  The 
usSEABED  database,  like  many  modern,  large  environmental  datasets,  is  heterogeneous  and 
interdisciplinary.  We  statistically  test  the  database  as  a  source  of  mean  grain  size  data,  and  from  it 
provide  a  first  examination  of  regional  seafloor  sediment  variability  across  the  entire  US  continental 
shelf.  Data  derived  from  laboratory  analyses  (“extracted”)  and  from  word-based  descriptions 
(“parsed”)  are  treated  separately,  and  they  are  compared  statistically  and  deterministically.  Data 
records  are  selected  for  spatial  analysis  by  their  location  within  sample  regions:  polygonal  areas 
defined  in  ArcGIS  chosen  by  geography,  water  depth,  and  data  sufficiency  (Figure  2).  We  derive 
isotropic,  binned  semivariograms  from  the  data,  and  invert  these  for  estimates  of  noise  variance,  field 
variance,  and  decorrelation  distance  (Figure  3).  The  highly  erratic  nature  of  the  semivariograms  is  a 
result  both  of  the  random  locations  of  the  data  and  of  the  high  level  of  data  uncertainty  (noise).  This 
decorrelates  the  data  covariance  matrix  for  the  inversion,  and  largely  prevents  robust  estimation  of  the 
fractal  dimension. 
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Figure  2.  Location  of  usSEABED  records  within  the  mid-Atlantic  Bight,  color  coded  by  mean  grain 
size,  and  overlain  on  bathymetric  contours  (meters).  Sample  areas  defined  for  this  region  are 

indicated  by  green  polygons  with  yellow  boarders. 


3 


LAG,  km 


Figure  3.  A  binned  semivariogram  (solid)  derived  from  parsed  mean  grain  size  measurements  in  the 
0-20  m  depth  range  of  the  New  York  Bight  (Figure  2).  The  best-fit  von  Kdrman  model  with  noise 
spike  is  overlain  (dashed),  with  parameter  values  as  indicated.  The  fractal  dimension  of  the  model  is 

1.5,  which  corresponds  to  an  exponential  curve. 

WORK  COMPLETED 

Project  funding  is  now  complete.  Goffs  primary  tasks  for  this  project  were  (1)  developing  a  tool  for 
correcting  noisy  data  through  resampling  and  (2)  utilizing  the  usSEABED  database  to  explore  seabed 
variability  in  US  shelfal  waters.  The  first  task  was  completed  and  published  in  Goff  et  al.  (2006).  The 
second  task  was  completed  last  year  and  is  the  topic  of  an  article  under  review  (Goff  et  al.,  in  review). 
This  progress  report  will  focus  on  the  latter  task,  for  which  a  preliminary  account  was  given  in  last 
year’s  annual  progress  report. 

RESULTS 

As  primary  component  of  the  study  we  examined  the  suitability  of  the  aggregated  usSEABED  data 
collection  for  mapping  and  variability  analysis.  Our  quantitative  comparison  (Figure  4)  between  the 
parsed  and  extracted  forms  of  mean  grain  size  data  reveal  some  differences.  As  expected,  the  noise 
variance  tends  to  be  larger  for  the  parsed  records  (by  -0.2-1.0  (j)  ~),  which  reflects  a  higher  level  of 
uncertainty  in  the  measurements.  Greater  temporal  variability  (i.e.,  timing  of  sample  collection)  may 
also  be  important.  At  present,  temporal  information  cannot  be  extracted  from  the  usSEABED 
database,  but  it  is  likely  that  the  word-based  data  records  span  a  much  greater  range  in  dates.  Any 
temporal  effects  on  grain  size  measurements  (e.g.,  changes  in  sedimentary  conditions,  changes  in 
navigational  resolution)  will  presumably  factor  into  the  data  uncertainty.  Higher  levels  of  uncertainty 
in  the  parsed  measurements  might  also  be  related  to  the  likelihood  that  they  are  more  likely  to 
incorporate  a  wider  set  of  materials,  such  as  shells. 
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Figure  4.  Plots  of  extracted  versus  parsed  statistical  parameters  from  sample  areas  with  adequate 
coverage  of  both  types  of  data  records.  Dashed  line  indicates  1:1  correspondence.  Circled  symbols 
in  (b)  and  (c)  are  from  Gulf  of  Maine  sample  regions,  and  are  discussed  in  the  text.  Correlation 
coefficients  (p)  are  given  for  each  plot,  neglecting  the  outliers. 


In  general,  the  extracted  mean  grain  sizes  tend  to  exhibit  higher  (j)  values  (finer  grain  sizes),  ~0.5  (j)  on 
average,  and  lower  field  variance  relative  to  the  parsed  mean  grain  sizes.  Both  observations  might  be 
explained  by  a  tendency  for  grain  size  analysts  to  discard  the  very  coarsest  fraction  of  a  sediment, 
particularly  if  it  contains  shell  material.  These  differences  between  parsed  and  extracted  measurements 
are,  however,  somewhat  regionally  dependent,  and  it  is  not  possible  to  formulate  a  precise  universal 
conversion  factor  between  the  two.  Nevertheless,  if  sufficient  numbers  of  each  type  of  data  exist 
within  a  particular  sample  region,  it  should  be  possible  to  empirically  define  a  local  conversion  so  that 
the  two  types  of  data  can  be  used  together,  along  with  their  respective  uncertainties,  for  quantitative 
applications  such  as  mapping. 

Our  analysis  of  sample  regions  for  the  usSEABED  records  of  mean  grain  size  on  the  continental  shelf 
reveal  considerable  geographic  variability  in  the  estimated  parameters  of  average  (Figure  5)  field 
variance  (Figure  6)  and  decorrelation  distance  (Figure  7).  High  field  variances  and  short  decorrelation 
lengths  on  the  Florida  shelves  may  indicate  a  high  level  of  patchiness  due  to  shelly  material.  Very 
high  variances  in  the  Gulf  of  Maine  may  be  a  result  of  residual  glacio-fluvial  gravel  patches 
interspersed  with  fine-grained  sediments.  Elsewhere,  we  observe  a  fairly  strong  inverse  relationship 
between  the  average  mean  grain  size  and  the  field  variance  (expressed  as  a  positive  correlation  in  (j) 
units).  We  are  uncertain  as  to  the  cause  of  this  correlation. 
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Figure  5.  Sample  areas  defined  over  the  entire  usSEABED  database,  color  coded  by  the  average  of 
the  parsed  mean  grain  size  measurements  within  the  sample  area. 


Figure  6.  Sample  areas  color  coded  by  estimated  field  variance  of  mean  grain  size  measurements. 
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Figure  7.  Sample  areas  color  coded  by  estimated  decorrelation 
distance  of  mean  grain  size  measurements. 


Other  than  the  small  values  on  the  Florida  shelf,  the  estimated  decorrelation  length  scales  do  not 
present  coherent  geographical  relationships.  Unlike  the  results  of  Jenkins  and  Goff  (in  review)  for  the 
analysis  of  mean  grain  size  measurements  in  the  Adriatic  Sea,  we  do  not  find  evidence  on  the  U.S. 
Atlantic  margin  for  any  consistent  depth  relationship  for  this  parameter  (other  regions  were 
insufficiently  sampled  to  discriminate  sample  regions  based  on  water  depth).  We  believe  that  analysis 
of  more  sample  areas  from  a  greater  variety  of  settings  will  be  needed  to  decipher  the  primary 
influences  on  decorrelation  length  scale.  We  suggest  here  that  it  may  be  controlled  by  competing 
relationships  of  geologic  inputs  (e.g.,  sediment  facies),  which  probably  tend  toward  larger 
decorrelation  length  scales,  and  oceanographic  reworking,  which  probably  tends  toward  shorter  length 
scales  (e.g.,  bedforms). 

Comparison  of  extracted  versus  parsed  statistical  parameters  provides  evidence  that  the  noise  variance 
estimated  from  parsed  and  extracted  mean  grain  size  measurements  are  correlated.  Assuming  the  noise 
variance  is  related  only  to  the  data  uncertainty,  there  is  no  reason  to  expect  such  a  correlation, 
suggesting  that  noise  is  somehow  influenced  by  the  properties  of  the  field.  However,  no  such  evidence 
could  be  found  in  our  interparameter  comparisons.  To  explain  these  observations,  we  hypothesize  that 
a  very  short-scale  of  field  variability  exists  that  is  superimposed  on  the  larger  scale  of  variability  that 
we  discern  through  estimate  of  the  decorrelation  length  of  the  semivariogram,  and  that  the 
decorrelation  length  of  this  shorter  scale  variability  is  shorter  than  the  resolution  scale  of  the  sample 
data.  In  other  words,  the  portion  of  data  variability  that  we  identify  as  “noise”  includes  both  a  real 
field  component  and  a  data  uncertainty  component.  If  true,  then  we  cannot  directly  distinguish 
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between  the  two,  although  we  may  be  able  to  infer  the  field  component  if  we  are  able  to  postulate 
globally  constant  values  of  uncertainty  for  parsed  and  extracted  measurements.  More  data  analysis 
will  be  required  to  detennine  if  that  is  the  case. 

Our  example  using  the  Long  Island  shelf  data  (Figure  8)  shows  that  usSEABED  can  reliably  be 
utilized  for  creating  maps  of  seafloor  mean  grain  size  and  possibly  other  sediment  characteristics.  Due 
to  the  noisy  character  of  the  data,  some  sort  of  filtering  or  other  noise  reduction  algorithm  (e.g.,  Goff  et 
al.,  2006)  is  recommended  prior  to  map  generation.  To  combine  the  parsed  and  extracted 
measurements,  a  bias  correction  proxy  must  be  applied,  and  such  a  correction  should  be  evaluated 
individually  for  each  region  of  interest.  For  the  Long  Island  shelf  data,  a  simple  mean  correction  of  0.5 
(j)  was  found  to  be  suitable  because  the  semivariogram  statistics  were  otherwise  found  to  be  very 
similar  between  the  two  types  of  data.  Recognizing  that  coarse  content  is  excluded  from  many  analytic 
results,  we  applied  the  correction  by  subtracting  it  from  the  extracted  data.  Other  regions,  however, 
exhibit  significant  difference  in  both  the  mean  and  variance  of  parsed  versus  extracted  mean  grain  size 
values,  and  in  those  cases  a  more  complex  bias  correction  proxy  must  be  devised. 

IMPACT/APPLICATIONS 

This  project  could  provide  a  major  advance  in  marine  science,  a  set  of  reliable  methods  which 
transform  point-site  seabed  data  into  griddings  that  will  be  useful  across  oceanographic  disciplines, 
sediment  transport,  acoustics,  habitat,  wave-energy  generation.  Our  work  will  result  in  a  set  of 
software  tools  that  will  be  open  source,  and  available  for  inclusion  any  existing  software  packages. 
These  tools  could  be  of  importance  to  the  Navy,  particularly  in  dealing  with  areas  with  sparse  data, 
such  as  “denied”  areas.  In  particular,  an  understanding  of  the  relationship  between  environmental 
parameters,  geologic  setting  and  spatial  variability  could  provide  an  ability  to  predict  the  amount  and 
spatial  scales  of  seabed  variability  using  a  parameterized  semi-variogram  model.  This  functionality 
provides  a  basis  upon  which  to  predict  seabed  parameters  at  unsampled  locations,  and  to  assess  the 
uncertainty  in  that  prediction.  Such  an  understanding  will  have  important  implications  for  assessing 
acoustic  prediction  uncertainty.  Furthermore,  the  semi-variogram  model  can  be  used  to  investigate 
optimal  survey  design,  should  it  be  possible  to  conduct  limited  sampling  in  denied  areas  via  covert 
means  (e.g.,  AUV’s). 


8 


40'  40'N 


73’  50'W  73'  4Q'W  73'  30'W  73'  20'W 


Figure  8.  Comparison  of  overlapping  portions  of  USGS  acoustic  backscatter  data  (a;  from  Schwab 
et  al.,  2000)  and  the  interpolated,  resampled  mean  grain  size  off  the  western  Long  Island  shelf  (b). 
Lighter  shades  indicate  in  (a)  higher  backscatter  and  coarser  sediment,  darker  shades  indicate  lower 
backscatter  and  finer  sediments.  Bathymetric  contours  on  both  plots,  from  the  NOAA  (2007)  coastal 

relief  model,  are  in  meters. 
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RELATED  PROJECTS 


This  work  is  not  presently  linked  to  any  other  programs,  but  could  prove  useful  to  ONR  programs  such 
as  the  Ripples  DRI  and  the  Shallow  Water  Acoustics  ’06  experiment,  which  will  make  use  of 
interpolated  point  data  related  to  seabed  properties. 
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